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Abstract. A central computational problem for analyzing and model 
checking various classes of infinite-state recursive probabilistic systems 
(including quasi-birth-death processes, multi-type branching processes, 
stochastic context-free grammars, probabilistic pushdown automata and 
recursive Markov chains) is the computation of termination probabilities, 
and computing these probabilities in turn boils down to computing the 
least fixed point (LFP) solution of a corresponding monotone polynomial 
system (MPS) of equations, denoted x = P{x). 

It was shown by Etessami and Yannakakis that a decomposed variant 
of Newton's method converges monotonically to the LFP solution for any 
MPS that has a non-negative solution. Subsequently, Esparza, Kiefer, 
and Luttenberger [7] obtained upper bounds on the convergence rate 
of Newton's method for certain classes of MPSs. More recently, better 
upper bounds have been obtained for special classes of MPSs f|10l 15]). 
However, prior to this paper, for arbitrary (not necessarily strongly- 
connected) MPSs, no upper bounds at all were known on the conver- 
gence rate of Newton's method as a function of the encoding size |P| of 
the input MPS, x = P{x). 

In this paper we provide worst-case upper bounds, as a function of both 
the input encoding size |P|, and e > 0, on the number of iterations 
required for decomposed Newton's method {even with rounding) to con- 
verge to within additive error e > of g*, for an arbitrary MPS with 
LFP solution q* . Our upper bounds are essentially optimal in terms of 
several important parameters of the problem. 

Using our upper bounds, and building on prior work, we obtain the 
first P-time algorithm (in the standard Turing model of computation) 
for quantitative model checking, to within arbitrary desired precision, of 
discrete-time QBDs and (equivalently) probabilistic 1-counter automata, 
with respect to any (fixed) tj-regular or LTL property. 



* A full version of this paper is available at arxiv.org/abs/1302.3741. Research par- 
tially supported by NSF Grant CCF-1017955. 



1 Introduction 



In recent years, there has been extensive work on the analysis of various classes of 
infinite-state recursive probabilistic systems, including recursive Markov chains, 
probabilistic pushdown systems, stochastic context-free grammars, multi-type 
branching processes, quasi-birth-death processes and probabilistic 1-counter au- 
tomata (e.g. [TTJ [T^ [HI [HI Uni SI)- These are all finitely-presentable models that 
specify an infinite-state underlying probabilistic system. These classes of sys- 
tems arise in a variety of fields and have been studied by various communi- 
ties. Recursive Markov chains (RMC), and the equivalent model of probabilistic 
pushdown systems (pPDS), are natural models for probabilistic programs with 
recursive procedures fTTifS]. Quasi-birth-death (QBD) processes, which are essen- 
tially equivalent (in discrete-time) to probabilistic 1-counter automata (plCA), 
are used in queueing theory and performance evaluation [501 HH]- Stochastic 
context-free grammars are a central model in natural language processing and 
are used also in biology ^ , and branching processes are a classical probabilistic 
model with many applications, including in population genetics (|14|). 

A central problem for the analysis and model checking of these systems is 
the computation of their associated termination probabilities. Computing these 
probabilities amounts to solving a system of fixed-point multivariate equations 
X = P{x), where a; is a (finite) vector of variables and P is a vector of polyno- 
mials with positive coefficients; such a system of equations is called a monotone 
polynomial system (MPS) because P defines a monotone operator from the non- 
negative orthant to itself. Each of the above classes has the property that, given 
a model M in the class, we can construct in polynomial time a corresponding 
MPS X — P{x) such that the termination probabilities of M (for various initial 
states) are the least fixed point (LFP) solution of the system, i.e., they satisfy 
the system, and any other nonnegative solution is at least as large in every coor- 
dinate. In general, a monotone polynomial system may not have any fixed point; 
consider for example x = x -\- I. However, if it has a fixed point, then it has a 
least fixed point (LFP). The systems constructed from probabilistic systems as 
above always have a LFP. which has values in [0, 1] since its coordinates give the 
termination probabilities. 

The equations are in general nonlinear, and their LFP solution (the vector of 
termination probabilities) is in general irrational even when all the coefficients of 
the polynomials (and the numerical input data of the given probabilistic model) 
are rational. Hence we seek to compute the desired quantities up to a desired 
accuracy e > 0. The goal is to compute them as efficiently as possible, as a 
function of the encoding size of the input (the given probabilistic model, or the 
MPS) and the accuracy e. We first review some of the relevant previous work 
and then describe our results. 

Previous Work. An algorithm for computing the LFP of MPSs, based on 
Newton's method, was proposed in [IT]. Given a MPS, we can first identify in 
polynomial time the variables that have value in the LFP and remove them 
from the system, yielding a new so-called cleaned system. Then a dependency 
graph between the variables is constructed, the variables and the MPS are de- 



composed into strongly connected components (SCCs), and Newton's method is 
applied bottom-up on the SCCs, starting from the all-0 vector. It was shown in 
[TT] that, for any MPS that has a (nonnegative) solution, the decomposed variant 
of Newton's method converges monotonically to the LFP. Optimized variants of 
decomposed Newton's method have by now been implemented in several tools 
(see, e.g., [221 119| ). and they perform quite well in practice on many instances. 

Esparza, Kiefer and Luttenberger studied in detail the rate of convergence of 
Newton's method on MPSs |7j (with or without decomposition). On the negative 
side, they showed that there are instances of MPSs x — P{x) (in fact even simple 
RMCs), with n variables, where it takes an exponential number of iterations in 
the input size to get even within just one bit of precision (i.e. accuracy 1/2). On 
the positive side, they showed that after some initial number fcp of iterations 
in a first phase, Newton's method thereafter gains bits of precision at a linear 
rate, meaning that kp + cp ■ i iterations suffice to gain i bits of precision, where 
both kp and cp depend on the input, x = P{x). For strongly connected MPSs, 
they showed that the length, fcp, of the initial phase is upper bounded by an 
exponential function of the input size |P|, and that cp — 1. For general MPSs 
that are not strongly connected (and for general RMCs and pPDSs) , they showed 
that Cp = n2" suflaces, but they provided no upper bound at all on kp (and none 
was known prior to the present paper). Thus, they obtained no upper bounds, 
as a function of the size of the input, x — P{x), for the number of iterations 
required to get to within even the first bit of precision (e.g., to estimate within 
< 1/2 the termination probability of a KMC) for general MPSs and RMCs. 
Proving such a general bound was left as an open problem in [7]. 

For special classes of probabilistic models (and MPSs) better results are now 
known. For the class of quasi-birth-death processes (QBDs) and the equivalent 
class of probabilistic 1-counter automata (plCA), it was shown in [lOj that the 
decomposed Newton method converges in a polynomial number of iterations in 
the size of the input and the bits of precision, and hence the desired termination 
probabilities of a given plCA M can be computed within absolute error e = 
2~' in a number of arithmetic operations that is polynomial in the size \M\ 
of the input and the number i — log(l/e) of bits of precision. Note that this 
is not polynomial time in the standard Turing model of complexity, because 
the numbers that result from the arithmetic operations in general can become 
exponentially long (consider n successive squarings of a number). Thus, the 
result of [10 shows that the termination problem for plCAs can be solved in 
polynomial time in the unit-cost exact rational arithmetic model, a model in 
which arithmetic operations cost 1 time unit, regardless of how long the numbers 
are. It is not known exactly how powerful the unit-cost rational model is, but 
it is believed to be strictly more powerful than the ordinary Turing model. The 
question whether the termination probabilities of a plCA (and a QBD) can be 
computed in polynomial time (in the standard model) was left open in [lOj. 

Building on the results of [TU] for computation of termination probabilities 
of plCAs, more recently Brazdil, Kiefer and Kucera |4] showed how to do quan- 
titative model checking of w-regular properties (given by a deterministic Rabin 



automaton) for plCAs, i.e., compute within desired precision e > the proba- 
bility that a run of a given plCA, M, is accepted by a given deterministic Rabin 
automaton, R, in time polynomial in M, i?, log(l/e) in the unit-cost rational 
arithmetic model. The complexity in the standard Turing model was left open. 

For the classes of stochastic context-free grammars, multi-type branching 
processes, and the related class of 1-exit RMCs, we showed recently in [5] that 
termination probabilities can be computed to within precision e in polynomial 
time in the size of the input model and log(l/e) (i.e. the # of bits of precision) in 
the standard Turing model [5]. The algorithm is a variant of Newton's method, 
where the preprocessing identifies and eliminates (in P-time [llj) the variables 
that have value 1 in the LFP (besides the ones with value 0). Importantly, the 
numbers throughout the computation are not allowed to grow exponentially in 
length, but are always rounded down to a polynomial number of bits. The analy- 
sis then shows that the rounded Newton's algorithm still converges to the correct 
values (the LFP) and the number of iterations and the entire time complexity 
is polynomially bounded. 

For general RMCs (and pPDSs) and furthermore for general MPSs, even if 
the LFP is in [0,1]", there are negative results indicating that it is probably 
impossible to compute the termination probabilities and the LFP in polynomial 
time in the standard Turing model. In particular, we showed in that approxi- 
mating the termination probability of a RMC within any constant additive error 
< ^, is at least as hard as the square-root-sum problem, a longstanding open 
problem that arises often in computational geometry, which is not even known 
to be in NP, and that it is also as hard as the more powerful problem, called 
PosSLP pLj, which captures the essence of unit-cost rational arithmetic. Thus, if 
one can approximate the termination probability of a RMC in polynomial time 
then it is possible to simulate unit-cost rational arithmetic in polynomial time 
in the standard model, something which is highly unlikely. 

As we mentioned at the beginning, computing termination probabilities is 
a key ingredient for performing other, more general analyses, including model 
checking [12118]. 

Our Results. We provide a thorough analysis of decomposed Newton's 
method and show upper bounds on its rate of convergence as a function of the 
input size and the desired precision, which holds for arbitrary monotone polyno- 
mial systems. Furthermore, we analyze a rounded version of the algorithm where 
the results along the way are not computed exactly to arbitrary precision but are 
rounded to a suitable number of bits (proportional to the number of iterations 
k of Newton's method that are performed), while ensuring that the algorithm 
stays well-defined and converges to the LFP. Thus, the bounds we show hold for 
the standard Turing model and not only the unit-cost model. Note that all the 
previous results on Newton's method that we mentioned, except for |9], assume 
that the computations are carried out in exact arithmetic. To carry out k itera- 
tions of Newton's method with exact arithmetic can require exponentially many 
bits, as a function of k, to represent the iterates. In general, the fact that New- 
ton's method converges with exact arithmetic does not even imply automatically 



that rounded Newton iterations will get anywhere close to the solution when we 
round to, say, only polynomially many bits of precision as a function of the num- 
ber of iterations fc, let alone that the same bounds on the convergence rate will 
continue to hold. We nevertheless show that suitable rounding works for MPSs. 

In more detail, suppose that the given (cleaned) MPS x ~ P{x) has a LFP 
g* > 0. The decomposition into strongly connected components yields a DAG of 
SCCs with depth c?, and we wish to compute the LFP with (absolute) error at 
most e. Let gj^j^^ and gj^ax be the minimum and maximum coordinate of q* . Then 
the rounded decomposed Newton method will converge to a vector q within e 
of the LFP, i.e., such that \\q* — q\\oo < e in time polynomial in the size |P| of 
the input, log(l/e), log(l/(7*jj„), \og[q^^^^ and 2'^ (the depth d in the exponent 
can be replaced by the maximum number of nonlinear SCCs in any path of the 
DAG of SCCs). We also obtain bounds on g*^;^^ and g^^x terms of \P\ and the 
number of variables n, so the overall time needed is polynomial in \P\, 2" and 
log(l/e). We provide actually concrete expressions on the number of iterations 
and the number of bits needed. As we shall explain, the bounds are essentially 
optimal in terms of several parameters. The analysis is quite involved and builds 
on the previous work. It uses several results and techniques from [71 [9] , and 
develops substantial additional machinery. 

We apply our results then to probabilistic 1-counter automata (plCAs). Us- 
ing our analysis for the rounded decomposed Newton method and properties 
of plCAs from [10], we show that termination probabilities of a plCA M (and 
QBDs) can be computed to desired precision e in polynomial time in the size \M\ 
of the plCA and log(l/e) (the bits of precision) in the standard Turing model 
of computation, thus solving the open problem of |10j . 

Furthermore, combining with the results of [4] and [12], we show that one can 
do quantitative model checking of w-regular properties for plCAs in polynomial 
time in the standard Turing model, i.e., we can compute to desired precision e 
the probability that a run of a given plCA M satisfies an cj-regular property 
in time polynomial in \M\ and log(l/e) (and exponential in the property if it is 
given for example as a non-deterministic Biichi automaton or polynomial if it is 
given as a deterministic Rabin automaton). 

The rest of the paper is organized as follows. In Section 2 we give basic 
definitions and background. In Section 3 we consider strongly-connected MPS, 
and in Section 4 general MPS. Section 5 analyzes plCAs. Most proofs are in the 
appendix. 

2 Definitions and Background 

We first recall basic definitions about MPSs from [TTj. A monotone ■polynomial 
system of equations (MPS) consists of a system of n equations in n variables, 
X = (xi, . . . , Xn)^ the equations are of the form Xi = Pi{x), i — 1, . . . , n, such 
that Pi(x) is a multivariate polynomial in the variables x, and such that the 
monomial coefficients and constant term of Pi{x) are all non-negative. More 
precisely, for a = (ai, a2, . . . , a„) G N", we use the notation a;" to denote the 



monomial x'^^x'^^ . . .a;""- (Note that by definition ' = 1.) Then for each 
polynomial Pi{x), i — 1, . . . ,n, there is some finite subset of N", denoted Ci, 
and for each a € Ci, there is a positive (rational) coefficient a^a > 0, such that 

For computational purposes, we assume each polynomial Pi{x) has rational 
coefficientijfl, and that it is encoded succinctly by specifying the list of pairs 
((ci,a, q;) I a e Ci), where each rational coefficient Ci^a is represented by giving its 
numerator and denominator in binary, and each integer vector a is represented in 
sparse representation, by only listing its non-zero coordinates, ii, . . . , i^, by using 
a list ((ji,a)ij), . . . , (ik,ai^,)), giving each integer in binary. (Proposition [1] 
below, from PUIS], shows that using such a sparse representation does not entail 
any extra computational cost.) 

We use vector notation, using x — P{x) to denote the entire MPS. We use 
\P\ to denote the encoding size (in bits) of the MPS x — P{x) having rational 
coefficients, using the succinct representation just described. 

Let R>o denote the non-negative real numbers. Then P{x) defines a mono- 
tone operator on the non-negative orthant M>o- In other words, P : M"q — > K>o, 
and if < a < 6, then P(a) < P{b). In general, an MPS need not have any real- 
valued solution: consider x ~ x + 1. However, because of monotonicity of P{x), 
if there exists a solution a E K"q such that a — P{a), then there exists a least 
fixed point (LFP) solution q* S R"g such that q* — P{q*), and such that q* < a 
for all solutions a £ M>o- Indeed, if for z £ R" we define P°{z) = z, and define 
pfc+i^^^ = P[P^[z)), for all fc > 0, then (as shown in [111) value iteration start- 
ing at the all-0 vector converges monotonically to q*: in other words Vfc > 
pfc(O) < P'=+i(0), and limfc^oo ^''(0) = 9*0 

Unfortunately, standard value iteration P'"'(0), fc — ?> cxd, can converge very 
slowly to g*, even for a fixed MPS with 1 variable, even when q* — 1\ specifi- 
cally, X ~ (l/2)a;^ + 1/2 already exhibits exponentially slow convergence to its 
LFP q* — \ (iJLLj)- It was shown in [TT] that a decomposed variant of Newton's 
method also converges monotonically to q* for an MPS with LFP solution q* . 
More recently, in [3] , a version of Newton's method with suitable rounding be- 

^ although we also reason about MPSs with positive real-valued coefficients in our 
proofs. 

* Indeed, even if an MPS does not have a p,nite LFP solution q* £ K>o, it always does 
have an LFP solution over the extended non-negative reals. Namely, we can define the 
LFP of any MPS, x = P{x), to be the vector q* £ l>o over l>o = (R>o U {-l-oo}), 
given by q* :— Vanu^oc P'°{0)- In general, it is PosSLP-hard to decide whether a 
given MPS has a finite LFP. (This follows easily from results in [11], although it is 
not stated there: is was shown there it is PosSLP-hard to decide if gi > 1 in an MPS 
with finite LFP q* £ K>o. Then just add a variable xq, and an equation xq — x^xi + l 
to the MPS. In the new MPS, q^ = +oo if and only if ql > 1.) However, various 
classes of MPSs, including those whose LFP corresponds to termination probabilities 
of various recursive probabilistic systems do have a finite LFP. Thus in this paper 
we will only consider LFP computation for MPSs that have a finite LFP q* £ R>o. 
So when we say "x = P{x) is an MPS with LFP solution g*", we mean q* £ R>o, 
unless specified otherwise. 



tween iterations was studied. Rounding is necessary if one wishes to consider the 
complexity of Newton's method in the standard (Turing) model of computation, 
which does not allow unit-cost arithmetic operations on arbitrarily large num- 
bers. In this paper we will apply a version of Newton's method to MPSs which 
uses both rounding and decomposition. Before describing it, we need some fur- 
ther background. 

An MPS, X — P{x), is said to be in simple normal form (SNF) if for every 
i = 1, . . . , n, the polynomial Pi{x) has one of two forms: (1) Form*: Pi{x) = XjXk 
is simply a quadratic monomial; or (2) Form+: Pi{x) is a linear expression 
X^jgc Pi-j^j +Pi,0: for some rational non- negative coefficients pi^j and Pi^^ and 
some index set Ci C {1, . . . , n}. In particular, in any MPS in SNF form every 
polynomial Pi{x) has multivariate degree bounded by at most 2 in the variables 
X. We will call such MPSs quadratic MPSs. 

As shown in [11, 9J, it is easy to convert any MPS to SNF form, by adding 
auxiliary variables and equations: 

Proposition 1. (Propos. 7.3 fTTj, and Propos. 2.1 of Every MPS, x = 
P{x), with LFP q* , can he transformed in P-time to an "equivalent" quadratic 
MPS y — Q{y) in SNF form, such that \Q\ £ 0{\P\). More precisely, the vari- 
ables X are a subset of the variables y, and y = Q{y) has LFP p* iff x — P{x) 
has LFP q* , and projecting p* onto the x variables yields q* . 

Furthermore, for any MPS, x — P{x), we can in P-time find and remove any 
variables Xi, such that the LFP solution has q* =0.^ 

Proposition 2. (Proposition 7.4 of fTTj/) There is a P-time algorithm that, 
given any MP^, X = P{x), over n variables, determines for each i €z {1, . . . ,n} 
whether q* =0. 

Thus, for every MPS, we can detect in P-time all the variables Xj such that 
q* = 0, remove their equation Xj = Pj{x), and set the variable Xj to on 
the RHS of the remaining equations. We obtain as a result a cleaned MPS, 
x' = Q{x'), which has an LFP q* > 0. 

Applying Propositions[l]and[2l we assume wlog in the rest of this paper 
that every MPS is a cleaned quadratic MPS, with LFP q* > 

In order to describe decomposed Newton's method, for a cleaned MPS, x = 
P{x) we need to define the dependency graph, Gp = {V,E), of the MPS. The 
nodes of Gp are the remaining variables Xi, and the edges are defined as 
follows: [xi,Xj) € E if and only if Xj appears in some monomial in Pi{x) that 
has a positive coefficient. 

We shall decompose the cleaned system of equation x — P{x), into strongly 
connected components (SCCs), using the dependency graph Gp of variables, and 
we shall apply Newton's method separately on each SCC "bottom-up". 

^ This proposition holds regardless whether the LFP q* is finite or is over the extended 
non-negative reals, R>o. Such an extended LFP exists for any MPS. See footnote [D 

® For compatibility when quoting prior work, it will sometimes be convenient to assume 
quadratic MPSs, rather than the more restricted SNF form MPSs. 



We first recall basic definitions for (a rounded down version of) Newton's 
method applied to MPSs. For an MPS, x — P{x), with n variables, we define 
B{x) = P'{x) to be the n x n Jacobian matrix of partial derivatives of P{x). In 
other words, B(x)ij = ^^f^- For a vector z e M", assuming that the matrix 
(/ — B{z)) is non-singular, a single iteration of Newton's method (NM) on x = 
P{x) at z is defined via the following operator: 

Mp{z) := z + {I - B{z))-\P{z) ^ z) (1) 

Let us now recall from [9^ the rounded down Newton's method, with parameter 
h, applied to an MPS: 

Definition 1. Rounded-down Newton's method (H-NMj , with rounding 
parameter h.) Given an MPS, x = P{x), with LFP q* , where < q* , in the 
rounded down Newton's method (R-NM) with integer rounding parameter h > 0, 
we compute a sequence of iteration vectors x^^^ , where the initial starting vector 
is x^^^ := 0, and such that for each k > 0, given x^''\ we compute x^'^^'^^ as 
follows: 

1. First, compute 2;'^*'+"'^^ :— Afp{x^^^), where the Newton iteration operator 
Mp{x) was defined in equation (Qp. (Of course we need to show that all such 

Newton iterations are defined.) \k+i] 

2. For each coordinate i — 1, . . . ,n, set x\ to he equal to the maximum (non- 
negative) multiple 0/2^'' which is < max(x|'^^^^, 0). (In other words, round 
down cc^*'"'"^^ to the nearest multiple of2^^, while making sure that the result 
is non-negative.) 

Now we describe the Rounded- down Decomposed Newton's Method 

(R-DNM) applied to an MPS, x = P{x), with real-valued LFP q* > 0. Firstly, 
we use Proposition [2] to remove variables, and thus we can assume we are given 
a cleaned MPS, x = P{x), with real- valued LFP q* > 0. 

Let Hp be the DAG of SCO's of the dependency graph Gp. We work bottom- 
up in Hp, starting at bottom SCCs. For each SCC, S, suppose its corresponding 
equations are xs = Ps{xs, xpx^g)), where D{S) denotes the union of the variables 
in "lower" SCCs, below S, on which S depends. In other words, a variable Xj G 
D{S) iff there is some variable Xi G S such that there is directed path in Gp 
from Xi to Xj. If the system xs = (2^5, 9^(5)) is a linear system (in xg), we 
call S a linear SCC, otherwise S* is a nonlinear SCC. Assume we have already 
calculated (using R-DNM) an approximation qD{S) to the LFP solution q'^rg-^ 
for these lower SCCs. We plug in qD{s) into the equations for S, obtaining the 
equation system xg — Ps{xg, qpi{s))- We denote the actual LFP solution of this 
new equation system by q'g. (Note that q'g is not necessarily equal to qg, because 
Qd{S) is only an approximation of q'^(^gy) 

If S* is a nonlinear SCC, we apply a chosen number g of iterations of R-NM 
on the system xg = Psixs-:QD{S)) to obtain an approximation qg of q'g; if S is 
linear then we just apply 1 iteration of R-NM, i.e., we solve the linear system and 
round down the solution. We of course want to make sure our approximations 



are such that — qs||oo < e, for all SCCs S, and for the desired additive error 
e > 0. We shall establish upper bounds on the number of iterations g, and on the 
rounding parameter h, needed in R-DNM for this to hold, as a function of various 
parameters: the input size \P\ and the number n of variables; the nonlinear depth 
f of P, which is defined as the maximum, over all paths of the DAG Hp of SCCs, 
of the number of nonlinear SCCs on the path; and the maximum and minimum 
coordinates of the LFP. 

Bounds on the size of LFPs for an MPS. For a positive vector u > 0, we 
use Umin = min^ Vi to denote its minimum coordinate, and we use Umax = max^ Vi 
to denote its maximum coordinate. Slightly overloading notation, for an MPS, 
X = P{x) , we shall use c,nin to denote the minimum value of all positive monomial 
coefficients and all positive constant terms in P{x). Note that Cmin also serves 
as a lower bound for all positive constants and coefficients for entries of the 
Jacobian matrix B{x), since B{x)ij — ^^f^- 

We prove the following Theorem in the appendix, establishing bounds on the 
maximum and minimum coordinates of the LFP q* of an MPS x — P{x). 

Theorem 1. If x = P{x) is a quadratic MPS in n variables, with LFP q* > 0, 
and where P{x) has rational coefficients and total encoding size \P\ bits, then 

1. <z*,i„>2-l^l(2"-i), and 

2 q* < 22("+l)(l-P|+2("+l)log(2«+2))-5" 

How good are our upper bounds? In the appendix we discuss how good 
our upper bounds on R-DNM are, and in what senses they are optimal, in light 
of the convergence rate of Newton's method on known bad examples ([7]), and 
considerations relating to the size of 5*^;^^ and g^ax- this way, our upper bounds 
can be seen to be essentially optimal in several parameters, including the depth 
of SCCs in the dependency graph of the MPS, and in terms of log i. 



3 Strongly Connected Monotone Polynomial Systems 

Theorem 2. Let P{x,y) be an n-vector of monotone polynomials with degree 
< 2 in variables which are coordinates of the n-vector x and the m-vector y, 
where n > 1 and m > 1 . 

Given non-negative m-vectors yi and y2 such that < yi < 1 and < j/2 ^ 
2/1, let Pi{x) = P{x,yi) and P2{x) = P{x,y2). Suppose that x — Pi{x) is a 
strongly- connected MPS with LFP solution < < 1. 

Let a — min{l, c„iin}iiiin{?/„iin, ^9min}' where Cmin 'is the smallest non-zero con- 
stant or coefficient of any monomial in P(x,y), where j/,nin is the minimum 
coordinate of yi , and finally where q*^i^ is the minimum coordinate of ql . Then: 

1. The LFP solution of the MPS x — P2{x) is (72 with < (72 ^ ^i; '"^'^ 
Iki* - 92II00 < ^4na-(3"+i)||P(J,J)|U||yi-y2||oo 



Furthermore, if x — Pi {x) is a linear system, then: 

H - 92II00 < 2na-(«+2)||p(i, i)||oo||yi - y2||oo 

2. Moreover, for every < e < 1, if we use g > h ~ 1 iterations of rounded 
down Newton's method with parameter 

h> [2 + 71 log - + log -1 
a e 

applied to the MPS, x = P2{x), starting at x^'^^ :— 0, to approximate then 
the iterations are all defined, and \\q2 — a;[^l||oo < £• 

Theorem [5] and its proof are at the heart of this paper, but unfortunately the 
proof is quite involved, and we have no room to include it. The proof is in the 
appendix. The following easy corollary is also proved in |2Tj . 

Corollary 1. Let x — P{x) be a strongly connected MPS with n variables, and 
with LFP q* where < q* < 1. Let a = min{l, Cminj^'Zmin' where Cmin is the 
smallest non-zero constant or coefficient of any monomial in P{x). 

Then for all < e < 1, if we use g > h — 1 iterations of R-NM with parameter 
h > [2 + nlogi + log i] applied to the MPS, x = P{x), starting at a;[°] := 0, 
then the iterations are all defined, and \\q* — a;[^l||oo < e- 



4 General Monotone Polynomial Systems 

In this section, we use the rounded-down decomposed Newton's method (R- 
DNM), to compute the LFP q* of general MPSs. First we consider the case 
where < q* < 1: 

Theorem 3. For all e, where 0<e<l, if x — P{x) is an MPS with LFP 
solution < q* <1, with q'^i^ = min^g*, and the minimum non-zero coefficient 
or constant in P{x) is Cmin; then rounded down decomposed Newton's method 
(R-DNM) with parameter 



h> 



3 + 2^ • ( log(i) + d ■ (log(a-(4«+i)) + iog(i6n) + log(||P(l)||„o)) ) 



using g > h — 1 iterations for every nonlinear SCC ( and 1 iteration for linear 
sec), gives an approximation q to q* with q < q* and such that \\q* — q\\oo ^ £• 
Here d denotes the maximum depth of SCCs in the DAG Hp of SCCs of the 
MPS X = P{x), f is the nonlinear depth, and a = min{l, c,jii,i} • ■^q*^^^- 

Before proving the theorem, let us note that we can obtain worst-case ex- 
pressions for the needed number of iterations g — h — 1, and the needed round- 
ing parameter h, in terms of only f < d < n < \P\, and e, by noting that 
log(||P(l)||oo) < 1^*1, and by appealing to Theorem[T]to remove references to q^^^ 



in the bounds. Noting that Cmin > 2 l-^l, these teh us that min{l, Cmin}^q^i^ > 
2-|P|2"-i_ Substituting, we obtain that any: 



> 



2 + 2^ • ( log(i) + d ■ (|P|2"(4n + 1) + (4n + 1) + log(16n) + ) 



(2) 



iterations suffice in the worst case, with rounding parameter h — g + 1. Thus, 
for i — log(l/e) bits of precision, g ^ kp + cp ■ i iterations suffice, where cp — 2^ 
and kp — 0(2^2"nc?|P|), with tame constants in the big-0. 

Proof (of Theorem\3\). For every SCC S, its height hs (resp. nonlinear height 
fs) is the maximum over all paths of the DAG Hp of SCCs starting at S, of 
the number of SCCs (resp. nonlinear SCCs) on the path. We show by induction 
on the height hs of each SCC S that hg - qsWoo < P^^'S'^'''' where /3 = 
16na-(3"+i)||p(l)||^ and 6 = [fif^- Note that since n > 1, e < 1, and a < 

Cmin, we have /3 > 1 and (5 < 1, and thus also 5 < VS. 

Let us first check that this would imply the theorem. For all SCCs, S, we 
have 1 < /is < d and < /s < /, and thus \\q*g - qs\\r>o < IS'^^S^'^'' < fi'^S"^'^ = 

We note that h is related to 6 by the following: 

1 2 

> 2 + nlog- +log- (3) 
a 

This is because log f = 1 + log i = 1 + 2-^ (log ^ + dlog/3) ^ 1 + 2f (log(i) + 
dlog(16na~'^"+^||P(l)||oo))- Note that (3) implies that this inequality holds also 
for any subsystem of a; = P{x) induced by a SCC S and its successors D{S) 
because the parameters n and 1 /a for a subsystem are no larger than those for 
the whole system. 

We now prove by induction on hs that — gs||cx) < . 

In the base case, /15 = 1, we have a strongly connected MPS xs = Ps{x)- If 
S is linear, we solve the linear system exactly and then round down to a multiple 
of 2-^. Then fs = 0, and we have to show \\q% - qs\\oc < ^'"'5^''^ = 136. But 



ks-9s||oo<2-''<|</?J 



h 

For the base case where S in non-linear, equation [3] and Corollary [1] imply 
that — gslloo < f , which implies the claim since 6 < 1 and /? > 1, hence 

Inductively, consider an SCC S with /15 > 1. Then S depends only on SCCs 
with height at most hs — 1. If S* is linear, it depends on SCCs of nonlinear depth 
at most fD(s) — fs, whereas if S is non-linear, it depends on SCCs of nonlinear 
depth at most fo^s) = fs — ^- We can assume by inductive hypothesis that 

\\<lh{s)-ms)\\oo < /3''^-M2"'"<^'.Takeg^ tobetheLFP ofxs = Ps{xs,qD(S))- 
Suppose 2:5 — Ps{xs, qpK^s)) linear in xs- Then Theorem[5]with yi := (7^(5) 
and 2/2 := yields 

hs - q'sWoo < 2nsa-("-+^)||P(l, l)||oo||gI,(5) " to(5)l|oo 



But 2nsa-("--+2)||F(l,l)||oo < f, so - g^||oo < §\\q*DiS) " to(S)l|oo < 

follows that llgj - qs\\oo < P^^'S^''^ 

Suppose that 2:5 — Pg{xg, 9^(5-)) is non-linear in 2:5. Theorem[5J with yi := 
^*D{S) ^^'^ y2 := to(s): yields that 

\\q*s ~ q'sWoo < ^4na-(3"+i)||P(l)||o,||q^(^)-(g)z3(s)||oo (4) 

Note that the a from Theorem [5] is indeed the same or better (i.e., bigger) than 
the a in this Theorem, because ymin = iq*u^s))iTiin > Qmin ^^nd (gs)min > q'^in- 

Rewriting (g]) in terms of /3, we have \\q*s - q's Woo < \Jjl3\\q*D(s) ~ (<?)d(s)I|oo- 

By inductive assumption, ||(7^(-5^ ~ to(s)||oo 

< I3^^'^6^ , and thus \\q*g - 
q'sWoo < ^i/3''s^2i-/s < iphsg2-fs _ rpj^^g gj^^^ ^j^g^^ ^j^g inductive hypothesis 
holds also for SCC S, it suffices to show that for the approximation qs we 
have - gs||oo < But /3 > 1, /is > 1, 2-^^ < 1 and 5 < 1, so 

|(5 < ^P'^^S^ , so it suffices to show that ||(?5 — (?s||oo < 1 5. Part 2 of Theorem 
[2]tells us that we will have H^g— 9s||cx3 < if g > h—1 and h > 2-|-nlog i+log |. 
But we have already established this in equation ([3]), hence the claim follows. 

□ 

Next, we want to generalize Theorem[3]to arbitrary MPSs that have an LFP, 
g* > 0, without the restriction that < g* < 1. The next Lemma allows us to 
establish this by a suitable "rescaling" of any MPS which has an LFP g* > 0. If 
X = P{x) is a MPS and c > 0, we can consider the MPS x — ^P{cx). 

Lemma 1. Let x — P{x) be a MPS with LFP solution q* , and with Jacobian 
B{x), and recall that for z > {), Afp{z) := z + (/ — B{z))^^(P{z) — z) denotes 
the Newton operator applied at z on x — P{x). Then: 

(i) The LFP solution of x = ^P{cx) is ^q* ■ 
(ii) The Jacobian of ^P{cx) is B{cx). 
(Hi) A Newton iteration of the "rescaled" MPS, x 
z is given by ^Afp{cz). 

Proof. From pLlj, we know that the value iteration sequence P(0), P{P{0)), 
P(P(F(0))) . . . P'=(0) converges to q*. Now note that for the MPS x = ^Picx), 
the value iteration sequence is ^/'(O), ip(cip(0)) = iP(P(0)), iP(P(P(0)))... 
which thus converges to ^q* . This establishes (i). 

For (ii), note that, by the chain rule in multivariate calculus (see, e.g., [2] 
Section 12.10), the Jacobian of P{cx) is cB{cx). Now (iii) follows because: 

z+{J-B{cz))-\-P{cz)-z) = -{cz + {I-B{cz))-^{P{cz)~cz)) = -Np{cz). 

c c c 



— ^P{cx), applied to the vector 



□ 



We use Lemma [T] to generalise Theorem |3] to MPSs with LFP q* , where q* 
does not satisfy q* < 1. 

Theorem 4. If x — Q{x) is an MPS with n variables, with LFP solution q* > 0, 
*/ ''min least positive coefficient of any monomial in Q{x), then R-DNM 

with rounding parameter h' , and using g' iterations per nonlinear SCC (and one 
for linear), gives an approximation q such that \\q* — q\\oo < e'? where 

g'^2+\2f- (log(i) + d-{2u + log(a'-(4"+i)) + log(16n) + log(||Q(l)||eo)) ) 1 
e' 

and h' = g' + 1 — u, where u = max{0, [log gj^^^^] }, d is the maximum depth 
of SCCs in the DAG Hq of SCCs of x — Q{x), f is the nonlinear depth, and 
a' = 2-2" min{l,c;„i„}min{l, ^q^^^J. 

We can again obtain worst-case expressions for the needed number of itera- 
tions g' , and the needed rounding parameter h' , in terms of only f < d < n < \Q\, 
and e', by noting that log(||(5(l)||oo) < |Q| and by appealing to Theorem[T]to 
remove references to 5*^;^^ and gj^^x the bounds. Substituting and simplifying 
we get that to guarantee additive error at most e', i.e. for i = log(l/e') bits 
of precision, it suffices in the worst-case to apply g' = kg + cq ■ i iterations of 
R-DNM with rounding parameter h' — g' + 1 (which is more accurate rounding 
than h' = g' + 1- u), where cq = 2-^, and kq = 0(2^ 5'^n'^d{\Q\ +n\ogn)) (and 
we can calculate precise, tame, constants for the big-0 expression). 

Corollary 2. If x ^ P{x) is an MPS with LFP solution q* with < q^^^ < 
Qi ^ 'Jmax "^^^ h with the least coefficient of any monomial in P{x), Cmin; with 
f the nonlinear depth of the DAG of SCCs of x = P{x) and with encoding size 
\P\ bits, we can compute an approximation q to q* with \\q* — q\\oo < for any 
given < e < 1, in time polynomial in \P\,2-I' , log j,log and logq*^^^^. 

Proof. After preprocessing to remove all variables Xi with q* = 0, which takes 
P-time in we use R-DNM as specified in Theorem |4l Calculating a Newton 
iterate at z is just a matter of solving a matrix equation and if the coordinates 
of z are multiples of 2~^ this can be done in time polynomial in |P| and h. 
Theorem |4] tells us that the number of iterations and h are polynomial in 2^ , 
log i, log logq*^axj '^j log ^"^^ ll^'(l)lloo- The last three of these are 
bounded by \P\. Together, these give the corollary. □ 

5 MPSs and Probabilistic 1-Counter Automata 

A probabilistic 1-counter automaton (plCA), M , is a 3-tuple M = (V, 5, 5q) 
where is a finite set of control states and 5 QV x R>o x { — 1, 0, 1} x F and 
X M>o X {0, 1} X are transition relations. The transition relation 5 is 
enabled when the counter is nonzero, and the transition relation 5q is enabled 
when it is zero. For example, a transition of the form, {u,p,—l,v) G S, says 
that if the counter value is positive, and we are currently in control state u, 



then with probabihty p we move in the next step to control state v and we 
decrement the counter by 1. A plCA defines in the obvious way an underly- 
ing countably infinite-state (labeled) Markov chain, whose set of configurations 
(states) are pairs {v,n) € V x N. A run (or trajectory, or sample path), starting 
at initial state {vo,no) is defined in the usual way, as a sequence of configura- 
tions {vq, uq), (vi, rii), (u2, ^2)1 • ■ • that is consistent with the transition relations 
of M. 

As explained in [TU], plCAs are in a precise sense equivalent to discrete-time 
quasi-birth- death processes (QBDs), and to 1-box recursive Markov chains. 

Quantities that play a central role for the analysis of QBDs and plCAs (both 
for transient analyses and steady-state analyses, as well as for model checking) 
are their termination probabilities (also known as their G-matrix in the QBD 
literature, see, e.g., [51 fTU]). These are defined as the probabilities, g,*^, of 
hitting counter value for the first time in control state v £ V , when starting 
in configuration (u, 1). 

Corresponding to the termination probabilities of every QBD or plCA is a 
special kind of MPS, x — P{x), whose LFP solution q* gives the termination 
probabilities of the plCA. The MPSs corresponding to plCAs have the following 
special structure. For each pair of control states u,v €V oi the plCA, there is 
a variable Xuv ■ The equation for each variable Xuv has the following form: 

Xuv = Puv ^ ^" ( ^ ^ Puiii'^u'i' ) "I" ^ ^ p\iy ^ ^ XyzXzv (5) 

where for all states u,v G V, and j G { — 1,0,1}, the coefficients puv are non- 
negative transition probabilities of the plCA, and such that for all states u G V, 

SjG{-i 1} '^vevP^'" — 1- '^^^ °f course clean up this MPS in P-time (by 
Proposition!!]) , to remove all variables Xuv for which qu^ — 0- In what follows, we 
assume this has been done, and thus that for the remaining variables < q* < 1. 

In [TU], the decomposed Newton's method (DNM) is used with exact arith- 
metic in order to approximate the LFP for plCAs using polynomially many 
arithmetic operations, i.e., in polynomial time in the unit-cost arithmetic model 
of computation. However [10] did not establish any result about the rounded 
down version of DNM, and thus no results on the time required in the stan- 
dard Turing model of computation. We establish instead results about R-DNM 
applied to the MPSs arising from plCAs, in order to turn this method into a 
P-time algorithm in the standard model of computation. 

It was shown in [10] that in any path through the DAG of SCCs of the 
dependency graph for the MPS associated with a plCA, M, there is at most one 
non-linear SCC, i.e. the nonlinear depth is < 1. Also, [TU] obtained a lower bound 
on the smallest positive termination probability. Namely, if Cmin denotes 

the smallest positive transition probability of a plCA, M, and thus also the 
smallest positive constant or coefficient of any monomial in the corresponding 
MPS, X = P(x), they showed: 



Lemma 2. ( Corollary 6 from 110]) q*^i^ > cj^iin? where r is the number of control 
states of the plC'A. 

They used these results to bouud the condition number of the Jacobian matrix 
for each of the hnear SCCs, and to thereby show that one can approximate q* in 
polynomially many arithmetic operations using decomposed Newton's method. 
Here, we get a stronger result, placing the problem of computing termination 
probabilities for plCA in P-time in the standard Turing model, using the results 
from this paper: 

Theorem 5. Let x = P{x) be the MPS associated with plCA, M, let r denote 
the number of control states of M , and let m denote the maximum number of 
bits required to represent the numerator and denominator of any positive rational 
transition probability in M . 

Apply R-DNM, including rounding down linear SCCs, to the MPS x — P{x), 
using rounding parameter h :— ^mr"^ + 2mr^ + 9r^ + 3 + [2 log i] and such that 
for each non-linear SCC we perform g = h—l iterations, whereas for each linear 
sec we only perform 1 R-NM iteration. 

This algorithm computes an approximation q to q* , such that \\q* ~q\\oo < £• 
The algorithm runs in time polynomial in \M\ and log ^, in the standard Turing 
model of computation. 

This follows from Theorem [H using the fact that log(l/g*ji,j) is polynomially 
bounded by Lemma [21 and the fact that the nonlinear depth of the MPS x = 
P{x) for any plCA is / < 1 ([IHI)- The detailed proof is in the appendix. 

5.1 Application to a;-regular model checking for plCAs 

Since computing termination probabilities of plCAs (equivalently, the G-matrix 
of QBDs) plays such a central role in other analyses (see, e.g., [HI [3l [TOl H]), 
the P-time algorithm given in the previous section for computing termination 
probabilities of a plCA (within arbitrary desired precision) directly facilitates 
P-time algorithms for various other important problems. 

Here we highlight just one of these applications: a P-time algorithm in the 
Turing model of computation for model checking a plCA with respect to any 
w-regular property. An analogous result was established by Brazdil, Kiefer, and 
Kucera [4J in the unit-cost RAM model of computation. 

Theorem 6. Given a plCA, M, with states labeled from an alphabet S, and 
with a specified initial control state v, and given an uj-regular property L{B) C 
S'^ , which is specified by a non- deterministic Biichi automaton, B, let PrM{L{B)) 
denote the probability that a run of M starting at configuration (w,0) generates 
an Lo-word in L{B). There is an algorithm that, for any e > 0, computes an 
additive e- approximation, p>0, of Pr]\[{L{B)), i.e., with \PrM{L{B)) — p\ < e. 
The algorithm runs in time polynomial in \AI\, log^, and 2'^', in the standard 
Turing model of computation. 
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A Background Lemmas, and Missing Proofs 

We first recall some Lemmas from [^jlll 

Lemma 3. (Lemma 3.3 of I9j) Let x — P{x) he a quadratic MPS, with n vari- 
ables, and let a,b € M". Then: 

„, ^ „,ci + b^, B(a) + B(b) , 

P{a) - P{b) = B{^){a -b)= ' ' ^ ' {a - b) 

Lemma 4. Let x — P(x) be a quadratic MPS. Let z G M" be any vector such 
that (/ — B{z)) is non-singular, and thus f\fp(z) is defined. Then: 

q ~ Np{z) ^ {I - B{z)) ^ [q - z) 

We will also need the following lemma from 

Lemma 5. (Lemma 5.4 from f^. Lemma 3.7 from JU]) Let x = P{x) be a MPS, 
with polynomials of degree bounded by 2, with LFP, q* > 0. Let B(x) denote 
the Jacobian matrix of P{x). For any positive vector d G M"q that satisfies 
B{q*)d < d, any positive real value A > 0, and any nonnegative vector z € K^O' 
if q* ^ z < Xd, and (/ — B{z))~^ exists and is nonnegative, then 

q*-Up{z) < ^d 

We also need to recall a number of basic facts from matrix analysis and 
Perron-Frobenius theory. For a square matrix A, let p{A) denote the spectral 
radius of A. Recall that a nonnegative square matrix A is called irreducible if its 
underlying directed graph is strongly connected, where the adjacency matrix of 
its underlying directed graph is obtained by setting the positive entries of the 
matrix ^ to 1. 

Lemma 6. (see, e.g., ]15^ . Theorem 8.4.4) U ^ irreducible nonnegative 

square matrix, then there is a positive eigenvector v > 0, such that Av — p{A)v. 
Such a vector v is called the Perron vector of A. It is unique up to resettling by 
a positive factor. 

Lemma 7. (see, e.g., flTf . Theorem 15.4-1 and Exercise 1, page 540) If A is 
an irreducible nonnegative square matrix and Q <^ B < A, but B ^ A, then 
p{B) < p{A). 

Lemma 8. (see, e.g., 1171, Theorem 15.2.2, page 531) If A is a square matrix 
with p{A) < \, then I — A is non-singular and {I — A)^^ = X^i^o 

In [5], the statements of Lemma 3.3 and 3.4 assume that the MPS is in SNF form, 
but as noted in [^, the proofs of Lemma 3.3 and 3.4 do not require that x = P{x) 
is in SNF form, nor that it is an MPS, only that it is quadratic. 



Lemma 9. (see, e.g., 1171, Section 15.3 and Exercise 11) If A is an irreducible 
nonnegative square matrix, and v > is a positive eigenvector associated with 
some eigenvalue r, i.e., such that Av — rv, then r = p{A). Thus v > is the 
Perron vector (which is unique up to scaling). 

A. 1 Proof of Theorem H 

Theorem [2l Let P{x, y) be an n-vector of monotone polynomials with degree 
< 2 in variables which are coordinates of the n-vector x and the m-vector y, 
where n > 1 and m > 1. 

Given non-negative m-vectors yi and y2 such that < yi < 1 o,nd < 2/2 £ 
yi, let Pi{x) = P{x,yi) and P2{x) = P{x,y2). Suppose that x = Pi{x) is a 
strongly- connected MPS with LFP solution < < 1- 

Let a = min{l, Cinin}niin{7;min, ■^q^i^}' "where Cmin "is the smallest non-zero con- 
stant or coefficient of any monomial in P(x,y), where j/,nin is the minimum 
coordinate of yi , and finally where q*^i^ is the minimum coordinate of q^ . Then: 

1. The LFP solution of the MPS x = ^2(2^) is with < 92 — '^'^^ 

hi - qUoo < ^4na-(3«+i)||P(l, l)\\^\\y, - ysHoo (6) 
Furthermore, if x — Pi (x) is a linear system, then: 

H - 92II00 < 2na-("+2)||P(l, l)\U\yi - yslloo (7) 

2. Moreover, for every < e < 1, if we use g > h — 1 iterations of rounded 
down Newton's method with parameter 

h> [2 + nlog- +logi] 
a e 

applied to the MPS, x — P2{x), starting at xl'^l 0, to approximate q2, then 
the iterations are all defined, and \\q2 — a;'^'||oo < £• 

Proof. We first establish 1. Since x — Pi{x) is a strongly connected system 
of equations, and ql > 0, this implies that matrix Bi(ql) is non-negative and 
irreducible, where Bi{x) is the Jacobian matrix of Pi{x). 

Thus, by Lemma [6l there is a positive Perron eigenvector t; > of Bi{ql), 
which satisfies Bi{ql)v — p{Bi{ql))v. We can always scale v such that ||w||oo ~ 1- 

We will observe that Bi{ql)v < v, and that if we scale v so that ||w||oo = 1 
then the smallest coordinate of v, denoted Wmin, has Vmin ^ Q^"- 

Lemma 10. (This is a variant of Lemma 6.5 from J 11^) For any strongly- 
connected MPS, X = P{x), with LFP q* > 0, and Jacobian B{x), we have 
p{B(q*)) < 1, and for all vectors y with <y < q* , p{B{y)) < 1. 



Proof. We will only show here that p{B{q*)) < 1 if a; = P{x) is strongly con- 
nected, but in fact this holds for any MPS, x = P{x), with LFP q* > 0. We do 
so because we will only use the strongly-connected case. 

If we have < z < y and z < P{z), then Lemma 6.4 of [111 shows that for 
any d > 1, B'^{z)(y - z) < P'^{y) - P\z). Let = P'(0), for alH > 1. Recah 
that limi^oo a;* — Q* ■ Also note that, because x — P{x) is strongly connected, 
< q* for all i. 

Then for all i,d> I, B'^{x'){q* ~ x') < P'^{q*) - P'^{x') = q* - x'+'^. But 
since lim^-^oo a:^*"'''^ = q* , we see that the right hand side goes to 0. But since 
(g* — X*) > for all i, it must be the case that — )■ 0, as d goes to infinity. 
But this is a necessary and sufHcient condition for p{B{x^)) < 1. Now notice 
that for any vector y such that < y < q*, there is some i such that y < x'^. 
Thus, by monotonicity of p{B{x)) in x > 0, we must have p{B{y)) < 1. 

Thus, also, since limi^.^x'^ — q* , and by continuity of the spectral radius 
function, we get that p{B{q*)) < 1. □ 

Corollary 3. p{Bi{q\)) < 1, and thus if v is a Perron vector of Bi{q*) then 
B^{ql)v^p{B^{ql))v<v. 

The following basic lemma, applied to Bi{q*) and its normalized Perron vector 
!>, yields the desired result about v: 

Lemma 11. If A is a irreducible, non-negative n x n matrix with minimum 
non-zero entry flmin; o-nd u > is a non-zero vector in R" with Au < u, then 
Omin < 1 o,nd if the minimum and maximum coordinates of u are denoted Umin 
and Wmax; respectively, then we have > a^j^^. In particular u > 0. 

Proof. Let z,j be some coordinates with Ui = Mmin and Uj — Umax- Because A 
is irreducible and non- negative, there is a power < /c < n with {A^)ij > 0. 
By matrix multiplication, for any A: > 1, {A'^)ij — ^ Yii -^nM+n where the sum 
is taken over all length A: -I- 1 sequences of indices ii,. . . ,ifc+i, with ii = i and 
ik+i = j, and with i/ e {1, . . . , n} for alH ranging from 1 to k. At least one of 
these products is non-zero and thus it is at least a^^ij,. That is {A'^)ij > a^^^. 
Since Au < u, and A is non-negative, a simple induction gives that A'^u < u. 
And since u is non-zero, Umax — Uj > 0, so < A^^uj < Ui. Since Ui — Uminj 
this means u > 0. Also, 1 > -^i™ = ^ > A'^, > a'^-. Note that since 1 > a'^-, 

' — Umax Uj — — mm — mm' 

this implies Cmin < 1- We know that 1 < fc < n, so a^;,-, > a^^j^. □ 

Applying Lemma [TT] to A — i?i(q*) and v the Perron vector of Bi{ql), 
normalized so that i^max = Ij and observing that the smallest non-zero entry of 
Bi{ql) is at least a, we get: 

Corollary 4. If v is the Perron vector of Bi{ql), normalized so that Wmax = 1; 
then = Umin > a" • 

Next, to show that < < ih we consider Pf{0) = Pi(Pi(...Pi(0)...)), 
i.e., the /c'th iterate of Pi applied to the vector 0, and -P2 (0)- We know that for 



any MPS, x = P{x) with LFP q* e K^g, we have hnifc^oo -P'' (0) = q* (jTT]). 
Thanks to the monotonicity of P, for any a; > 0, we have Pi{x) > P2{x). By the 
monotonicity of Pi and an easy induction, (0) > P2°(0)- So g]* > 92- 

Next we want to obtain the bounds © and ([7]) on \\q\ — 92 1 loo- 9i = 92 5 
then we are triviaUy done so we assume that 7^ 9i- Because x — Pi{x) is at 
most quadratic, we can apply Lemma |3] to get: 

Si ( ^ (91* + 92 ) ) (9i* - 92 ) - Pi ) - Pi (92* ) = 9i* - Pi (92 ) (8) 

Multiplying both sides of equation ([8]) by —1, and then adding {ql — (72) to both 
sides, we get: 



{I - Si(i(9i* + 92*)))(9i* - 92) = (9i* - 92) - (9i* - ^^1(92)) 

= ^1(92) - 92 

^ PM) - P2iq*2) (9) 

Provided that {I - Bi{^{ql + q'^)) is non-singular, we can multiply both sides 
of equation © by (/ — Bi{^{ql + 92))^^, to get 

ql ql = {I- i?i(i(gr + 92))"'(A(92*) - ^2(92)) (10) 



We will be taking the ||.||oo norm of equation ([T0|) to obtain the bound we 
need for 1192—9^1100- To do this we first need to bound ||(/ — ^1(^(9^ +92))^^||oo, 
and in particular we need to show that (/ — Bi{^(ci[ + 92))"^ is nonsingular. 

By m we have <zt- Pi (92*) = Bi(^(9i*+92*))(9i*-92*)- Now Pi (^2*) > P2(92*) = 
92*. Thus ql - 92* >ql- Pi (92*)- So Pi(i(9i* + 92*))(9i* - 92*) < (9i* - 93)- Since 
each polynomial in P(x,y) has degree no more than 2, each entry of Bi{x) is a 
polynomial of degree no more than 1 in both x and in the entries of yi when these 
are treated as variables. In other words, each entry of Bi{x) can be expressed 
in the form CiXi) + c'^yj) + c", where c^-, and c" are all non-negative 
coefficients and constants of P{x, y) (possibly multiplied by 2 in the case where 
the term of P{x,y) they originate from is of the form cx^) for all indices i and 
j. So for any i,j Bi{^ql)ij > ^Bi{ql)ij. Also, since 92 < 9i and ql > 0, we have 
-Si(|(9i + 92)) > Bi{^ql), and the matrices Bi{^{ql + q^)) and Bi{^ql) are 
both irreducible. Also, both these matrices have non-zero entries > a, because 
the coefficients a, c'j, and c" are all > Cmin, and the entries of ^ql and ^{ql -I-9I) 
are both > ^q^i^- Now, Lemma [TTl apphed to matrix A — Bi{^{ql + 93)) and 
vector u = (9i — 92)1 yields that 

I'^lzf,"'''' > (11) 

(9l 92)max 

In particular, we have thus also shown that if 92 ql then: 



92 < 9i 



(12) 



Now suppose that Bi{x) is not independent of x. Since < Qi^ there is some 
entry of Bi{^{ql + q2)), say Bi{^{ql +q2))ij, which is strictly smaller than that 
of Bi{ql)ij. The entry Bi{x)ij must be of the form {J^i Ci^i) + ^jVj) + c", 
where for some fc, > so that the term CkXk depends on Xk- We must therefore 
have Bi{ql)ij - {B{^{ql+q2))ij > c^in^iql ~ q2)k, for some indices z, j, k. From 
inequality (fTTl) we know that > a", for all indices k. Thus, since 

ill - 92)max = \\qt - g2l|oo, WC havC 

BliqDij - Bi{^{ql + ql))^j > Cmin^(gJ - 92)/= 

>«"+'^ll'7i* -921100 (13) 

Since ^2 < ^ili + I2) < 1i- This combined with Lemma [TO] together 
imply that p{Bi{^{ql + q^))) < 1, and thus that (/ - Bi{^{ql + 92)))^^ exists 
and that [I - Bi{^{ql + q;)))-^ = J^Zo Bii^ill + 12)))" > 0- Now we need the 
following result from |10j : 

Lemma 12. (Lemma 18 from fW\/) Let A e M"^" ^ ^ l^>o s'^c/i i/iai; 
(/ — A)^^ — X^fcLo ' (-^ — < 1, and A is an irreducible nonnegative 

matrix whose smallest nonzero entry is c > 0, and b =/= and p > is the largest 
entry of b. Then \\(L-A)-^^<^. 

We wiU take A = Bi{^{ql + 92)) and b = (L - Bi{^{ql + 92)))^ in this Lemma 
(recall that v is the normalized Perron vector of Bi[ql), such that Wniax = !)• 
We know that (/ - Bi{ql))v > 0. So 

b>iBM)-B,i^iql+q;)))v>0 (14) 

Inequality p3p gives us a lower bound for a single entry of the non-negative ma- 
trix (Si(gi*)-Bi(i(gi*+g2*))), namely the (iJYth entry. In {B,{ql)- Bi{l{ql + 
q2)))v this (i,J)'th entry is multiplied by a coordinate of v, which is at least 
t;min- Thus, combining inequalities ([TO]) and ([TO]), we have ||&||oo > ~ 
92llooWmin- From CoroUary |4] we have that Vn^in > a". So 6 > and ||fo||oo > 
a2"+ii||q* - q*\\^. Now, by definition, {I - Bi{^{ql + q^)))-^b ^ v < 1. Since 
the smallest non-zero entry of ^ = Bi{^{ql + 92)) is at least a, and since 
ll^llcxj > a^^^^^llg* — 92II00, Lemma ITOl now gives that 

ii(/-B.(^M + .,;)))-iu< „„.,.iJ;'^^.|i^ (15) 

Next suppose that Bi{x) is independent of x (i.e., Pi{x) consists of linear or 
constant polynomials in a;). We can thus write it as a constant, irreducible 
Jacobian matrix of Pi(x), where the MPS x = Pi{x) has an LFP ql > 0. 
It must therefore be the case that p{Bi) < 1, because we already know from 



Lemma [TU] that for all z such that < z < qj", we have p[B\(z)) < 1, but Bi{z) 
is independent of z, because Bi is a constant matrix. 

Let us apply Lemma 3.3 of [9], i.e., Lemma [3] above, with a — ql, 6 = 0, 
and Pi{x) in place of P{x). We get [Bi) ■ (g* - 0) = Pi(g^) - P(0). Multiplying 
both sides of this equation by —1 and then adding ql to both sides, we get 
{I-Bi)ql = Fi(0), and thus ql = (/ - Bi)~iPi(0). Since ql > 0, we must have 
that Pi(0) ^ 0. But Pi(0) > 0. Indeed, \\PiiO)\\^ > c^in niin{l, > a^. The 

smallest non-zero entry of Bi is at least c,„in • min{l,?/min} > a. We now apply 
Lemma [T^ to A := Bi and b := Pi(0), where we note that (/ - Bi)-^Pi{0) ^ 
ql <1. Lemma [T^ thus gives: 

lia-Si(^(9i*+92)))"'lloo <na-("+2) (16) 

Since \\ql — q2\\oo < 1 (9i < 1 and > 0), and < a < 1, and since n > 1, 
the upper bound (jlSp for the non-linear case is worse than the upper bound (jl6p 
for the linear case, so the upper bound holds in all cases. 

We have shown that (/ — Bi{^{ql + g^))) is non-singular, since p{Bi{^{ql + 
Q2))) < 1- Equation (jTU]) is thus valid, and taking norms of (jTU]) yields: 

lki*-g2lloo< ||(/-Pi(i(gi* + g2)))"'lloo||Pi(<Z2)-^2(g2)IU (17) 

Inserting our upper bound (|15l) for || (-/^ — Pi(-^('?i + 'Zl)))^^!!':© gives: 

2?? 

Ikl* - 4\U < 11^1 (g2) - P2(g2*)l|oo 

« l|yi '/2II00 

We now move the \\ql — 92 1 1 00 terms to the left and take square roots to obtain: 



Iki* - 92 II 00 < v'2na-(3"+i)||Pi(g2*)-P2(g2*)||oo (18) 



Lemma 13. If 0<x<l, then \\Pi{x) - P2{x)\\^ < 2\\P{1, l)\\oo\\yi - y2\U- 

Proof. Since each entry of P{x, y) is a quadratic polynomial, for each b G {1,2} 
and each d G {1, . . . , n}, the d'th coordinate, {Pb{x))d, of Pb{x) = P{x, yt) has 
the form 



where ybj refers to the j'th coordinate of the m- vector yb, and where all the 
coefHcients ad,i,j, Cd,ij, '^'dij^ '^'dk ^'^'^ ^'d ^ ^''^ non-negative. Also, recall < 
2/1 < 1 and < y2 < yi- Thus, 

\\Pl{x) ~ P2{x)\\oo 

= max^ Cd,jj(yi,j?/ij - y2,iy2,j) + ^ Q , ,,x,(yij - y2j) + ^ ^AyiM - 2/2,fc) 

i,j i,j k 

- n^a^Xl ~ ^2.*) + (yi,i - y2,j)) + c'd^tjiyij - 2/2j) + Y '^'d,k{y^M 

i,j i,j k 

- ™-ax^2 • cd,^,J -Wyi- 2/2II00 + • - y2\\<x. + J2^d,k\\yi - y^Woc 

hj hj k 

= (max^ 2cd,,j + c'^^, + c'l,^) ■ \\yi - 2/2II00 
<2||P(l,l)|U||yi-y2||oo 

□ 

Combining (fT8)) and Lemma [131 we have. 



lki*-92*lloo < •\/47^a~(3"+i)||F(l,l)||oo||2/2-yi||oo 

which completes the proof of the first inequality of part (1.) of Theorem [2l 
We show the second inequality ([7]) of part (1.) in the next lemma. 

Lemma 14. If Bi{x) is a constant matrix, i.e. x — Pi{x) is linear, 
hi - 92*1100 < 2na-("+2)||P(l, l)||oo||yi - y2||oo 



Proof. If q*i = (72, then the result is trivial. So we assume that ^ In the 
proof of the first inequality, under the assumption that B\(x) is a constant, we 
obtained equation (jl6p . So, under the assumptions of this Lemma, equation (I16p 
is valid, and we can substitute the bound p6|) into the equation (jl7[) instead. 
This gives 

M - 92II00 < na-("+2)|lPi(q*) - P2{ql)\U 
Again, Lemma [T3l gives a bound on ||Pi(q2) ~ ^2('?2)lloo- Substituting this gives: 

\Wl - g2*lloo < na-("+2)2|lP(l, l)||^||yi - y2||oo 

□ 



We will next establish part (2.) of Theorem[2j Let us first prove that, starting 
from x'"] :— 0, all the iterations of R-NM, applied to a; = ^2(2;) are defined. 



We firstly note that if < < g* and p(B2(xW)) < 1, then 7Vp2(xW) 
is well-defined and < x^^+^^ < q^. If p(B2(x[''l)) < 1, then by Lemma |S1 
(/ — i?2(a;['^J)) is non-singular and so Afp^^x^''^) is well-defined. Lemma |S] also 
gives that (J - BaCxW))-^ = 'ZZa ^2(2;^)' > 0. Lemma H yields that: 

92 - A/P. (-['^1 ) = (/ - 52 (^W))-l ^^('^^*)'^^^("'") _ ^[^=1 ) 

Note that {q^ - xl*-'!) > 0, thus that 52(172) " B2ix^''^) > 0, and we have just 
shown that (/ — i?2(a;['^l))~^ > 0. So all the terms on the right of the above 
equation are non-negative, and thus 53 ~ ■^P2i^^''^) — 0- That is ^ A/pa (a;'*^' ). 
^.[fc+i] defined by rounding down Afp2{x^''^) and maintaining non-negativity, 

thus for all coordinates i, either x\ — 0, in which case trivially we have 
^[fc+i] ^ Q ^ g^gg Q ^ ^[fc+i] ^ _/^^j2.[fc]). < Thus xl'^+il < 92*- 

What is still missing is to show that p{B2{x^'^^^^)) < 1. If we can show this 
then by an easy induction, for all fc, A/p2(a;''°l) is well-defined and < x^''^ < q2- 
We will prove p{B2{x'^^~^^^)) < 1 by considering separately the cases where Pi{x) 
contains non- linear or only linear polynomials. 

Lemma 15. If x ^ Pi^) is a strongly- connected quadratic MPS with n vari- 
ables, with LFP q* > 0, and there is some non-linear quadratic term in some 
polynomial Pi{x), then if < z < q* , then J\fp{z) is defined and Hp {z) < q* . 

Proof. Lemma [TOl tells us that p{B{q*)) < 1. Non-linearity of P{x) means that 
B{x) does depend on x. That is, some entry of B{x) contains a term of the form 
cXi for some Xi with c > 0. So B{z) 7^ B{q*), and B{z) < B{q*) since B is 
monotone. Since x — P{x) is strongly-connected and q* > 0, Lemma [TOl vields 
that p{B{z)) < 1. By Lemma [51 (/ — B{z)) is non-singular and so the Newton 
iterate Afp{z) is well-defined. Consider the equation given by Lemma |4l 

- Mp{z) = il- - z) 

We know that q* - z > 0, and thus B{q*) - B{z) > 0. Since p{B{z)) < 1, 
by Lemma [51 (/ — B{z))^^ = '^'kLoB{z)'^ > 0. This and Lemma [31 is already 
enough to yield that q* — Afp{z) > 0, and we just need to show that this is a 
strict inequality. 

We first show that if Pi{x) contains a term of degree 2, then ( -^^^-^^-^^ (g* — 
z))i > 0. This term of degree 2 must be of the form cXjXk for some j,k. Then 
B{x)ij has a term cxk with c > and so {B{q*) — B{z))ij > c{q* — z)k- But 
then (M_lM£)(g* _ ,))^ > c{q* - z)kiq* ~ z), > 0. 

Now we will show that for all i e {1, . . . ,n}, {q — Mp{z))i > 0. If Pi(x) 
contains a term of degree 2, then we have just shown that {^^^^^-^^-^^{q* — z))i > 

0. But - EZoBiz)" > I. So (q-Afpiz)), > {^^^{q* - 

z))i > 0. If Pi{x) does not contain a term of degree 2, there must be some other 
Xj with Pj{x) containing a term of degree 2 and, since x = P{x) is strongly- 
connected, Xi depends on Xj, possibly indirectly. That is, there is a sequence 



of variables io,ii,...,i; with I < n, iq — — j, and for each < m < l,Xi^ 
appears in a term of P{x)i^_-^. Let k be the the least integer such that P{x)i^ 
contains a term of degree 2. Then if < m < fc, appears in a degree 1 
term in P(x)i^_-^, that is one of the form CmXm with c„i > 0. So B{x)i^_-^_i^ 
contains the constant term c„i > 0. So B{z)i^_^,i^ > c„i > 0. So B'^{z)i^i^ > 
nJn=^o > rim^^o'^m > 0- Sincc P{x)i^ Contains a term of degree 2, 
from above (M^^l?* " ^))». > «■ So (^^(z) (g* - z))^ > 0. But 

g* - AAp(z) = (/- i3(z))-iM£)_M£) (g* _ z) = B™(z))M^(g* - 

z) > - ^). So {q*~Afp{z)), > for all as required. □ 

We will only actually need to apply Lemma [15] in the case when q2 = ql and 
X = Piix) is non-linear. 

Suppose that q^ = ^2 some polynomial in Pi{x) is non-linear in x. We 
claim that then Pi{x) = P2{x). That is, for all those variables in y, say {y)j, 
that actually appear in some polynomials in P{x,y), it must be the case that 
iyi)j — iy2)j- Otherwise, if there is some variable (y)j with (2/2)^ < {yi)j such 
that {y)j appears in then {P2{q*l))^ = {P{ql,y2))z < P{qt,yi))t = {ql)t, 

so ql is not a fixed point of P2 (x) , contradicting that ql =92- Thus if x = Pi (x) 
is non- linear and ql = ^2 then x = P2{x) is also non- linear and q2 — Qi > 0, so 
we can use Lemma [151 which shows that if < x^'^l < 92i then Afp.^{x^''^) < 
and so < < (72 < Qi- Since xl'^+^l < gif, we have p(i?i(a;[''+^l)) < 1. Since 

-B2(a:['=+il) < Bi(a;['=+il), we also have p{B2{x^''+^'i)) < 1. 

This leaves us with two cases remaining to show that all Newton iterates 
exist: first, the case where a; = Pi{x) is linear or constant, and second, the case 
where x — Pi{x) is non-linear and ^2 7^ Recall that it is sufficient to show 
that p{B2{x^''^)) < 1 for all iterates in order to show that all R-NM iterates exist. 
It thus suffices to show that in these cases for any < z < (72, p{B2{z)) < 1. 

For the first case, suppose that x = Pi{x) is linear. Then Bi{x) is a constant 
matrix. Thus -61(2:) — Bi{0) for all < z. But Lemma [TOl tells us that, since 
< ql, p{Bi{0)) < 1. Thus p{Bi{z)) < 1 for all < z < g*. Since < B2{z) < 
Bi{z), we have p{B2{z)) < 1 for all < z < 52- 

For the second case, suppose that gj 7^ ql and that x = Pi{x) is non- 
linear, and thus Bi{x) depends on x. Then we have previously argued that 
92 < it (see inequality ([l2])). But then Bi{q2) ^ Bi{ql). For any < z < ^21 
B2{z) < B2{q2) < Bi{ql) < Bi{ql) but because Bi{ql) ^ Bi{ql), we have 
i?2(z) 7^ Bi{q\). But Bi{q*) is irreducible, and Lemma [7| then tells us that 
p{B2{z)) < p{Bi{ql)). But we know, by Corollary [31 that p{Bi{ql) < 1. So 
p{B2{z)) < 1. 

Thus the R-NM iterations applied to a; = P2{x) are defined in all cases, and 
yield iterates < x^'^^ < q* , for all fc > 0. We can now prove the upper bound 
on the rate of convergence for R-NM applied to a; = P2{x). 

Lemma 16. Suppose an MPS, x — P{x), with n variables has LFP < < 1, 
and for some n-vector v > we have B{q*)v < v. Suppose we perform g > h—1 
iterations of R-NM with parameter h > 2 + [log J''"'"^^ ] on the MPS x = P{x), 



and suppose that for all k > 0, every iteration x^'^^ is defined and < x^'^l < q* . 
Then \\q* - a;M||oc < e. 

Proof. By induction on k, we claim that Vfc > 0, <?* - xl'^l < (2"'= + 2-''+'^):^v. 
Note that this would indeed yield the Lemma: for all fc < x''^' < q*. and 
the claim would yield q* - x^s] < (2-^+1 + 2-''+^):^v < 2"'°*^^Sl?^^w = 

It remains to prove by induction on A: > that q*—x^''^ < (2^'^ + 2^'*+^):j^u. 
This is true for fc = 0, because q* > — a;'"', and q* — = q* < 1 < -^j^v. 

LemmaOthen gives that q* - AAp(j:W) < {2~^''+'^'> + 2-'*);^w. Now, by 
definition of xl'^+^l, AAp(xW) - x^''+^^ < < 2-^-^v. So q* - xI'^+iJ < 

{2-~(k+i) j^2-^'+^):^v as required. Thus < 2-™2_i_„ < < 

□ 

To use Lemma [16] to get a bound on using R-NM on x = P2 (x) to compute 
note that because Q < B2{q2) < Bi{ql), the Perron vector w > of i?i(gj'), 

which satisfies Bi{ql)v < v, must also satisfy B2{q2)v < v. 

Thus, we just need to perform g > h — 1 iterations of R-NM on x = P2{x), 

with parameter h > 2 + log """"^ > 2 + loga~"e~^ in order to obtain that 

II 92 - x[^~^l lloo < £■ This completes the proof of Theorem El □ 

Corollary [TJ Let x — P{x) be a strongly connected MPS with n variables, and 
with LFP q* where Q < q* < 1. Let a = min{l, Cminj^'Zmin' where Cmin is the 
smallest non-zero constant or coefficient of any monomial in P{x). 

Then for allQ < e < \, if we use g > h—1 iterations of R-NM with parameter 
h > \2 + nlogi + log i] applied to the MPS, x = P{x), starting at x^°'^ := 0, 
then the iterations are all defined, and \\q* — xI^'Hoo < £• 

Proof. This is just a trivial application of Theorem [2l part 2., where we define y 
to be a dummy variable of dimension m — \, and we define 2/1=1/2 = J/min = 1, 
and where we define the n- vector of monotone polynomials /'(x, y), by replacing 
all constant terms c > in every polynomial in P(x) by cy. In this case, note that 
Pi(x) = P2{x) = P{x), and that since j/min = 1, the a defined in the statement 
of this corollary is the same a as in Theorem [2j □ 

A. 2 Proof of Theorem [4] 

Theorem |4l If x ~ Q{x) is an MPS with n variables, with LFP solution q* > 0, 
'^min least positive coefficient of any monomial in Q{x), then R-DNM 

with rounding parameter h' , and using g' iterations per nonlinear SCC (and one 
for linear), gives an approximation q such that \\q* — q\\oc ^ e'j where 

g' = 2+\2f- (log(i) +d-{2u + log(a'-(4n+i)) + iog(l6n) + log(||Q(l)||^)) ) 1 
e 

and h' ~ g' + 1 — u, where u — max{0, [logq*^^^^]}, d is the maximum depth of 
SCCs in the DAG Hq of SCCs of the dependency graph of x = Q{x), f is the 
nonlinear depth of Hq, and a' — 2^^" min{l, c^^j^} min{l, igj^jj^}. 



Proof. If (?*,ax ^ Ij then Theorem [3] gives this immediately. So we assume that 
9max > 1. w is chosen so that 2" > g^ax- We rescale and use Lemma [T] with 
scahng parameter c ~ 2". This yields the "rescaled" MPS x = 2^"Q(2"a;), which 
has LFP p* = 2-"g* < 1. 

So we can apply Theorem |3] to this rescaled MPS x = P{x), where P{x) = 
2~"Q(2"a;), and letting e :— 2~"e'. Then Theorem |3] gives us the needed number 
of iterations g and the rounding parameter h — g + 1, needed to obtain an 
approximation p of the LFP p* ~ 2^^q*, such that \\p — p*||oo < £• 

In the bounds specified for Theorem [3] for g and h, in place of gj^^jj^ we get 
pj^;^ = 2~"(7^;jj, and in place of Cmin we get 2~"cJ„i„. Thus a becomes the a' 
we have specified in the statement of this theorem. Furthermore, the ||P(l)||oo 
appearing in Theorem[3]is now ||2~"(5(2"l)||oo, but it is easy to verify that for 
a quadratic MPS, ||2-"Q(2"l)||oo < 2"||g(l)|joo. 

Theorem [3] tells us that if we use R-DNM on a; = P{x) for g iterations per 
nonlinear SCC and a precision of h — g + 1 bits, we will obtain an approximation 
p to the LFP p* of X = P{x) with \\p - _p*||oo < e provided that /i > [3 + 
2^ • (log(i) + d- (log(a-(4"+i)) + log(16n) + log(||P(l)||oo)))l- This condition is 
satisfied if we take g = g' and h ^ g' + 1 because: 

[3 + 2/ • (log(i) + d ■ (log(a-(4"+i)) + log(16n) + log(|lP(l)||oo)))l 

< 3 + 2/(log(^) + d(log(a'-(4«+i)) + log(16n) + log(2"||Q(l)||oo)))l 

= 3 + 2fiu + log(i) + d(log(a'-(4"+i)) + log(167^) + u + log(||Q(l)||oo)))l 
e' 

<g' + l^h 

Thus, applying R-DNM on a; = P{x) with parameters g = g' and h = g' + 1 
yields an approximation p to the LFP p* of x = P{x) with — p*||oo < e or, in 
terms of the original MPS, ||p - 2-"g*||oo < 2-"e'. 

To obtain Theorem SI we now show that if we apply R-DNM to a; = Q{x) 
with LFP q* , using rounding parameter h' and using g' iterations per nonlinear 
SCC (where h' and g' were specified in the statement of the Theorem), we will 
obtain an approximation q to q* that satisfies q = 2"p. This would then give us 
that \\q* ~ q\\oo = ||2>* - 2"p||oo = 2"||p* -_p||oo < 2"e = e', which is what we 
want to prove. 

Since we are using the decomposed Newton's method, we will show that qs = 
2"ps for every SCC S by induction on the depth of the SCC S. Suppose that for 
the variables D{S) that S depends on (if any), we have that q£i{S) — '^^Pd(s)- If 
we call the fcth iterate of R-NM applied to xs — Ps{xsiPd(S)) with parameter h, 
a;!*^! and the fcth iterate of R-NM applied to xs — Qs{xs, to(S)) with parameter 
h', x'^''\ then we aim to show by induction on k that a;'!''! — 2^x^^\ 

The base case is a;'[°l = = 2"x[°l. By abuse of notation, we will call the 
Newton iterate of a;^ = Ps{xs,Pd{S)): f^p{xs) and that of xs = Qsixs, to(s)), 
■f^qixs)- Note that because we assume that qD(S) — '^^Pd(S)i xs — Ps{xs,Pd{S)) 



is the result of scaling xs = Qsi^s, to(s)) using c — 2". So Lemma [T] (iii) yields 
that Afp{xs) = 2-"7VQ(2"a;s). If = 2"xW, then 7VQ(a;'W) = 2"A/'p(a;W). 

If (7Vp(a;W)), < 0, we would set a;f+^' 0. Uao,J\fQ{x'^% = 2"7Vp(a;W), < 
0, so we would set x'}''^^^ 0. 

If (A/'p(x['=]))i > 0, we set xf^^' to be the resuh of rounding {J\fp{x^''^))i 
down to a multiple of 2''. But then 7Vq(x'W) = 2''A/'p(a;W) > and we would 
set x'}''^^^ to be the result of rounding (AfQ{x'^^^))i down to a multiple of 2^'* . 
Note that h' — h — u. So the result of rounding 2"(A/'p(a;['^l))j down to a multiple 
of 2"'' is just 2" times the result of rounding {Afp{x^''^))i down to a multiple of 
2-^. So = 2"x['=+il. 

This completes the induction showing that 

^>[k] ^ 2"a;[''-l for all fc > 0. Note 
that g = g'. So qs = x'^^ 1 = 2"x[^l = 2^Ps- This in turn completes the induction 
on the SCCs, showing that q = 2"p, which completes the proof. □ 



A.3 Proof of Theorem [5] 

Theorem [5l Let x — P{x) he the MPS associated with plCA, M, let r denote 
the number of control states of M , and let m denote the maximum number of 
bits required to represent the numerator and denominator of any positive rational 
transition probability in M . 

Apply R-DNM, including rounding down linear SCCs, to the MPS x = P{x), 
using rounding parameter 

h := 8mr'^ + 2mr^ + Or^ + 3 + [2 log -'] 

e 

and such that for each non-linear SCC we perform g — h — 1 iterations, whereas 
for each linear SCC we only perform 1 R-NM iteration. 

This algorithm computes an approximation q to q* , such that \\q* — g||oo < £• 
The algorithm runs in time polynomial in \M\ and log i, in the standard Turing 
model of computation. 

Proof. We apply Theorem [3J which tells us that R-DNM with parameter 



h > 



-2f ■{ log(i) + d ■ (log(a-(4"+i)) + log(16n) + log(||P(l)||^)) ) 



(19) 

using g = h — 1 iterations for every SCC, gives an approximation q to q* with 
q < q* and such that ||q* — ^||oo < £• Here / < 1 since there is at most 1 non-linear 
SCC in any path through the dependancy graph. Furthermore, n — r"^ since the 
variables in x are indexed by two states x^v Also, d < n, and so d < r^. Also, 
Cmin > 2-™ and so by Lemma[21 > 2-""'". So a > 2-('"'-'+i). To show that 

||-P(1)IU < r, by equation jS]), P{l)uv ^ pL^^ + iJ2wev P^°-^) + J2yev P^^y'^ ^ 
Plugging all this into equation ([TH), we get: h > [3 -I- 2 • (log(i) -I- • {{Ar^ + 
l)(mr^ + 1) + log(16r^) -I- logr] . Noting that log(16r^) -f logr = log(16r^), and 



noting that when r > 1, log(16r'^) < 4r, we have: 

h>3 + Smr'^ + 2mr^ + Or"^ + [2 • log(-)] 

Note that the rounding parameter h and the number of iterations g = h — 1 
are both polynomials in the encoding size of the plCA, and in logi. Thus 
each iteration of R-DNM can be computed in polynomial time, and we only do 
polynomially many iterations. Thus the entire computation of q can be carried 
out in P-time in the Turing model of computation. □ 

A. 4 Proof of Theorem [6] 

Theorem[6l Given aplCA, M, with states labeled from an alphabet S, and with 
a specified initial control state v, and given an uj-regular property L{B) C S'^ , 
which is specified by a non- deterministic Biichi automaton, B, let PrM{L{B)) 
denote the probability that a run of M starting at configuration (t',0) generates 
an Lj-word in L{B). There is an algorithm that, for any e > 0, computes an 
additive t- approximation, p > 0, of PrM{L{B)), i.e., with \PrM{L{B)) — p\ < e. 
The algorithm runs in time polynomial in \M\, log-, and 2l^l, in the standard 
Turing model of computation. 

Proof (sketch). By Theorem [SJ we know we can compute termination probabili- 
ties q* for a plCA, M, with additive error e > in time polynomial in and 
logi 

Let us first observe that if we do not insist on having the w-regular prop- 
erty specified by a non-deterministic Biichi automaton B, and instead assume 
it is specified by a deterministic Rabin automaton R, then the analogous theo- 
rem follows immediately as a corollary of Theorem O and results established by 
Brazdil, Kiefer, and Kucera in [4]. Specifically, in [4] it was shown that, given 
a plCA, M, and a deterministic Rabin automaton, R, and given e > 0, there 
is an algorithm that, firstly, decides in P-time whether Pm{L{R)) > 0, and if 
so computes a value p which approximates Pm{L(R)) with relative error e > 0, 
i.e., such that \Pm{L{R)) — p\/ Pm{L{R)) < e, and the algorithm runs in time 
polynomial in |M|, and log-^, in the unit-cost RAM model of computation. 

The first observation we make is that, the results in [TU] and [3] together 
imply that for pfCAs there is no substantial difference in complexity between 
relative and absolute approximation, because the probabilities Pm{L{R)) can be 
bounded away from zero by l/2''°'^(l*^l'l-'^l) if it is not equal to zero (which can 
be detected in P-time). Thus, computing Pm{L{R)) with given relative error 
e > is P-time equivalent to computing Pm{L{R)) with e absolute error. 

Secondly, a close inspection of shows that the only use made in their 
entire paper of the unit-cost RAM model of computation is for the purpose of 
computing termination probabilities for plCAs, and specifically because they 
directly invoke the earlier result from [TD] which showed that termination proba- 
bilities q* for a plCA can be e-approximated in polynomial time in the unit-cost 
RAM model. Thus, the only thing needed in order to obtain an absolute error 



e-approximatioii of P]\i{L(R)) in P-time in the standard Turing model of com- 
putation is to appeal instead to Theorem [S] of this paper for computation of 
termination probabilities in P-time in the standard Turing model, and apply the 
rest of the construction in [4J. 

Next, let's first note that we can of course use Safra's construction to convert 
any non-deterministic Biichi automaton S to a deterministic Rabin automaton 
of size 2'^'l^l'°8l^". So, obtaining a complexity bound that is polynomial in 
2|B|log|B| difficuh. 

Let us now very briefly sketch why one can in fact obtain the (slightly) bet- 
ter complexity bound, polynomial in 21^', by combining prior results regarding 
model checking of RMCs \22i with Theorem [5] and Lemma [2l and with the key 
result by Brazdil, et. al. in [3], which establishes that non-zero non-termination 
probabilities for a plCA are also bounded away from zero by l/2^°'^'^l^^l^. 

As shown in fS', , for probabilistic model checking a naive subset construc- 
tion can be used (instead of Safra's construction) to obtain from a BA, B, a 
deterministic Biichi automaton, D, such that \D\ = 2l^l. (It need not be the 
case that L{D) = L{B).) One then constructs the "product" M (E) D, of the 
plCA, Af , with the deterministic Biichi automaton D. A key observation is that 
this "product" remains a plCA. In terms of RMCs, plCAs correspond to 1-box 
RMCs, and the "product" of a 1-box RMC with a deterministic BA, D, remains 
a 1-box RMC. 

It was shown in [12] that given a "product" (1-box) RMC M (E) D, it is possi- 
ble to construct a finite-state conditioned summary chain, A4' , which is a finite 
state Markov chain and whose transition probabilities are rational expressions 
in positive termination and non-termination probabilities of the (1-box) RMC. It 
is then possible to identify in P-time certain bottom strongly connected compo- 
nents T of Ai', such that the probability Pm{L{B)) is equal to the probability 
that starting from a specific initial state of A^', a run eventually hits a state in 

r. 

In this way, the model checking problem is boiled down to the problem of 
computing hitting probabilities in a finite-state Markov chain whose transition 
probabilities are simple rational expressions with numerators and denominators 
that are products of coefficients in a plCA together with positive termination 
and non-termination probabilities of a plCA. 

It is well known that non-zero hitting probabilities for a finite-state Markov 
chain are the unique solution {I — A)^^b, to a linear system of equations x = 
Ax -\- b, where the coefficients in A and b come from the transition probabilities 
of the Markov chain. The key remaining question is, how well- conditioned is this 
linear system of equations?. In other words, what happens to its unique solution 
if we only approximate the coefficients in A and b to within a small error? Now, 
the key is that applying Lemma [5] (which is from |10|). and applying the key 
result in [J, together shows that both positive termination and positive non- 
termination probabilities of the product plCA are bounded away from by 
1 /2Po'y(l*^M-D|) 



Under these conditions, exactly the same known condition number bounds 
from numerical analysis that were used in |10| namely Theorem 17 of |10| . which 
is a version of Theorem 2.1.2.3 of [TC], also establish that the linear system 
of equations that one has to solve for hitting probabilities in the conditioned 
summary chain M' derived from a plCA are "polynomially well-conditioned", 
meaning that approximating their non-zero coefficients within suitable 1/2^°''^ 
additive error yields a linear system of equations whose unique solution is e-close 
to the unique solution of the original system, for the chosen e > 0. We omit a 
detailed elaboration in this conference paper. □ 



B Proof of Theorem [T] 

Theorem [ij // x = P{^) o, quadratic MPS in n variables, with LFP q* > 0, 
and where P{x) has rational coefficients and total encoding size \P\ hits, then 

1. 2-l^l(2"-i), and 

2 q* < 22("+l)(l-P|+2("+l)log(2n+2))-5'' 

Proof. We first prove (1.), by lower bounding q*^)^^ in terms of the smallest con- 
stant Cmin in P{x)- 

Lemma 17. If x ^ P{^) has LFP q* > 0, and least term Cmin, then q*^^-^^ > 
min{l,Ci„in}^""^- 

Proof. We first observe that, since q* > 0, and there are n variables, it must 
be the case that P"(0) > 0. To see this, for any y > 0, let us use Z{y) to 
denote the set of zero coordinates of y. For any fc > 0, P'"'+'(0) > P'^(O), for 
all Z > 0, so Z{P^+\id)) C Z{P^{G)). Thus either \Z{P^+^{Q))\ = \Z(P^{Q))\ or 
|_^(pfe+i(0))| < |Z(P'=(0))| - 1. Now |Z(0)| = n and |Z(P'^(0))| > for all fc, so 
there must be some least < fc < n such that \Z{P^{Q))\ = \Z {P^+^ {{)))\ and 
such that Z{P^{G)) = Z{P^+^{{))). 

Note that, for any y > i), Z{P{y)) depends only on Z{y) and on P{x), but 
not on the specific values of non-zero coordinates of y. 

So if for some n>k>0, Z{P''+^{0)) = Z(P'=(0)) then, by a simple induction 
Z{P''+^{0)) = Z(P'^(0)) for all I > 0. So we must have Z(P'=(0)) = Z(P"(0)) = 
Z(P"+'(0)), for aU / > 0. Now lim„^oo ^""(0) = q*. Now if P"(0)j = 0, then 
P"+'(0)i = for all I > 0, and so q* = 0. This contradicts our assumption that 
q* > 0. So P"(0) > 0. 

Let us use P'^(0)@ to denote the minimum value of any non-zero coor- 
dinate of P^{0). Firstly, P(0) ^ 0, i.e., there is some non-zero constant in 
some polynomial P{x)i. Thus P(0)@ > Cmin. We show by induction that for 
fc > 0, P^{Q)@ > min{l, Cijiin}^''^^. This is true for fc = 0. We assume that 
P'^iO)® > min{l,c,„i„}2'°-i. If for some coordinated, P'^+HO)^ = P(P'=(0))j > 0, 
there must be a term in P{x)i which is not zero in P{P^'{Q))i, this is ei- 
ther a constant c, or a linear term cxj with P^{Q)j > 0, or a quadratic term 



cXjXi with P''{0)j > and P''{0)i > 0. In any of these 3 cases, this term 
is > Cmin niin{l, F'^(0)(a}^. Since P'^(0)@ > niin{l, Cmin}^'"^^, we now have 
that P''+^{0)@ > Cmi„(min{l,c,„i„}2'°-i)2 > min{l, Cminl^'^'^^ So for all k, 
P''{0)@ > min{l, Cmin}^'""^- In particular P"(0)(a > min{l, Cmin}^"~^- But 
P"(0) > so^P«(0),„i„ > min{l,cn,in}2"-^ We know q* > P"(0), so q*^,^ > 
min{l,Ci„i„}2"~^ □ 

To get our lower bound on q*-^^^ in terms of |P| and n, we just note that clearly 
Cmin > 2-1^1. This and Lemma [T71 give the bound q'^-^^ > 2-l^l(2"-i) in part (1.) 
of the Theorem. 

We now prove part (2.). To prove the upper bound on q*^^;^^, we need the 
following isolated root separation bound for systems of polynomial equations by 
Hansen et. al. P7]: 

Theorem 7. (Theorem 23 from flSf) Consider a polynomial system of equations 

{S) gi{xi,. . . ,Xn) = ■ ■ ■ = gm{xi,. ■ ■,Xn) ^ , (20) 

with polynomials of degree at most d and integer coefficients of magnitude at 
most 2^ . 

Then, the coordinates of any isolated (in Euclidean topology) real solutions of 
the system are real algebraic numbers of degree at most (2(i+l)", and their defin- 
ing polynomials have coefficients of magnitude at TTiost 2^"('^+''"'°s^'^™^'(^''+-^^" . 
Also, if^j — (7j.ij ■ ■ ■ ilj,n) is an isolated solution of {S), then for any i, either 

2-2n(r+2nlog(dm))(2d+l)"-i ^ l^^.^l 7j,, = . (21) 

Moreover, given coordinates of isolated solutions of two such systems, if they are 
not identical, they differ by at least 

sep(Z') > 2"^"(^+^"'°s(''"))(2'^+^)'"~'"^'°8i(") . (22) 

To apply Theorem [71 we now establish that q* is an isolated solution of an 
MPS with LFP q* > 0. 

Lemma 18. If x — P{x) is a quadratic MPS with LFP q* > 0, then q* is an 
isolated solution of the system of equations x — P{x). 

Proof. Firstly, we consider strongly connected MPSs. These can be divided into 
two cases, linear strongly-connected MPSs, where B{x) = P is a constant matrix 
and P{x) is affine, and nonlinear strongly-connected MPSs, where B{x) is not a 
constant matrix and P{x) is nonlinear. 

For the linear case, the Jacobian is a constant B{x) = B, and x — P{x) = 
Bx + P(0). We know that p{B{q*)) < 1 from Corollary H and thus since B = 
P(0) — B{q*), from Lemma [TUl we know that p(B) < 1, and thus (/ — B) is 
non-singular, and there is a unique solution to x = P{x) = Bx + P(0), namely 
q* = {I — B)~^P{0). Being unique, this solution is isolated. 



Now suppose, for contradiction, that x — P{x) is a non-linear strongly- 
connected quadratic MPS but that q* > is not an isolated solution to a; = P{x). 
Because q* is not isolated, there is another fixed-point q with — g||oo < <Zmin 
and q q* . Then q > and, since q* is the least non- negative fixed-point, 
q > q* . From Lemma [3] we have: 

P{q)-P{q*)^B{^{q*+q)){q-q*) 
Because q* and q are fixed points 

q-q*=B{l{q*+q)){q-q*) 

Lemma [TT] now yields that since q — q* > but q — q* and B{^{q* + q)) is 
irreducible, q > q*. Thus q — q* > is a positive eigenvector of the irreducible 
matrix B{^{q* + q)) associated with eigenvalue f , thus p{B{^{q* + q))) = f by 
Lemma [S] 

We now again invoke the assumption of non- isolation of q*, which implies 
there is a vector q' ^ q* such that q' ~ P{q') and \\q* — q'\\oa < min{g*jjj^, i(q — 
9*)min}- By the same reasoning as above, we have that q' > q* and p{B{^{q* + 
q'))) — 1. But now the condition — g'||oo < 5(9 — q*)min yields that q' < 
Q* + ^{q~q*) < (?• We thus also have that ^{q*+q) > ^{q* +q'), andheca,use B{x) 
is non-constant and monotone in a;, we have B{^{q* + q)) > B{\{q* + q')) and 
B{\{q*+q)) ^ B{\(q*+q')). However, p{B{\{q* + q))) = I = p{B(\[q* + q'))). 
This contradicts Lemma [71 So q* is also isolated in this case. 

This estabhshes that for aU strongly-connected MPSs, with LFP q* > 0, q* 
is isolated. 

Now suppose that x = P{x) is not strongly-connected. For each SCC S of 
X = P(x), the MPS 2:5 — Ps{xs,q}j(^g^) is strongly connected, so its LFP q^ 
is an isolated solution of xs = Psixs,qD{s)y That is, there is an £5 > such 
that if qs has ||gs - q*g\\ < eg and qs = Ps{qs, ^^(s))' tl^^n qs = q^. Now take 
e = minsjeg}. We claim that if \\q — q*\\oo < e and P{q) = q, then q = q* . We 
can show this by induction on the depth of strongly-connected components. If 
iS* is a bottom strongly-connected component, then qs has \\qs ^ qsWoo l£ ^ ^ £s 
and qs = Ps{qs)- So qs = qs- If S* is a SCC and for all variables D{S) that 
variables in S depend on, directly or indirectly, qD{s) — 9i5(s)j then qs has 

qs = Ps{qs,qD(s)) = Ps{qs,q*D{s))- ^^'i ll^s - q*s\\oo < e < es are 

enough to establish qs = qs- This completes the induction showing that q = q* . 
So q* is isolated solution for any MPS with LFP q* > 0. □ 

For each Xi, let di be the product of the denominators of all coefficients of 
P{x)i. Then dix — diP{x)i clearly has integer coefficients which are no larger 
than 2l^l. Suppose x = P{x) has LFP q* > 0, and suppose that coordinate k is 
the maximum coordinate of q*, i.e.. that = g^ax- Now consider the system of 
n + 1 polynomial equations, in n + 1 variables (with an additional variable y), 
given by: 



diXi ~ diP{x)i , and for all i G {1, . . . , n}; and Xk y = ^ ■ (23) 

Lemma [TSl tells us that g* > is an isolated solution of a; = P{x). If z G M" 
is any solution vector for x = P{x), there is a unique w G K such that x := z 
and y := w forms a solution to the equations (j23p : namely let w — j^. So, letting 
X :— q* , and letting 2/ ^ for ^-U *i gives us an isolated solution of the equations 
We can now apply Theorem [7] to the system For y = equation 

(|2T1) in Theorem [7] says that 

2-2(n+l)(|P|+2(n+l)log(2n+2))5" ^ J_ — =0 

Since q* > 0. clearly ^ =^ 0, so ^ = ^ > 2-2(n+i)(|PI+2(n+i) iog(2n+2))5" ^ go 

9max < 2^("+^)^l^l+^^"+^)'°8^(^"+^^^^". (24) 

□ 



B.l How good are our upper bounds for R-DNM on MPSs? 

We prove in this paper upper bounds on the number of iterations required by 
R-DNM to converge to within additive error e > of the LFP q* for an arbitrary 
MPS X = P{x). 

We now discuss some important parameters of the problem in which our 
upper bounds can not be improved substantially. 

To begin with, our upper bounds for the number of iterations required contain 
a term of the form 2'^log-^. Here d denotes the nesting depth of SCCs in the 
dependency graph Gp of the input MPS, x = P{x). 

It was already pointed out in [7J (Section 7) that such a term is a lower bound 
using Newton's method on MPSs, even for exact Newton's method (whether 
decomposed or not), even for rather simple MPSs. [7] provided a family of simple 
examples entailing the lower bound. Indeed, consider the following MPS, x = 
P{x), which is a simpler variant of bad MPSs noted in [7]. The MPS has n + 1 
variables, xq, . . . , x„. The equations are: 

= '^^i + ^^i-i : for alH e {1, . . . , n} 

1 2 1 

^0=2-0+2 

The LFP of this MPS is g* = 1, and it captures the termination probabilities 
of a (rather simple) stochastic context-free grammar, pBPA, or 1-exit Recursive 
Markov chain. Note that the encoding size of this MPS is \P\ — 0{n). 

As observed in [11], exact Newton's method, starting from x'^^^ :— 0, on the 
univariate equation xq = ^Xq + ^ gains exactly one bit of precision per iteration. 
In other words, if a;^'^^ denotes the /c'th iterate, then 1 — a;''^^ — 2~^. 



Suppose we perforin m iterations of exact NM on the bottom SCC, xq = 
^Xq + ^, and suppose that by doing so we obtain an appoximation q'q — 1 — aQ, 
where ao = 2^™. Plugging the approximation q'q into the next higher SCC, the 
equation for xi becomes xi — ^x\ + ^gg. For the rest of the argument we do not 
need to appeal to Newton iterations: even exact computation of the LFPs for 
the remaining SCCs will yield bad approximations overall unless 1 — <Zo — 5^ 
(showing that the system of equations is terribly ill-conditioned) . 

Indeed, by induction on i > 0, suppose that the value obtained for LFP of Xi 
is q[ = {\ — Oi). Then after plugging in q[ in place of x^ in the SCC for x^+i, the 
adjusted LFP, q[^^, of the next higher SCC: x,+i = {l/2){xif + (1/2)(1 - a^), 
becomes q[j^i = 1 — ^/al- Thus, by induction on depth, the adjusted LFP of a;„ 
becomes q'n = l- ag"". But ao = 2-™. Thus g^^ = 1 - 2-™2-"_ 

We would like to have error 1 — q'^ = 2"™^ " < e. Taking logs, we get that 
we must perform at least m > 2" log ^ NM iterations on the bottom SCC alone. 

Note that n here is also the depth d of SCCs in this example. 

Other terms in our upper bounds on the number of iterations required to 
compute the LFP of a general MPS are log and \ogq*^^^. Simple "repeated 

squaring" MPSs, with Xi = xf_i, xq — or 2}, show that we can have q'^^^ < 
■^TT, and qj^ax ^ 2^ , where n is the number of variables. In Theorem [1] we give 
explicit lower bounds on q'^i^^ and explicit upper bounds on q^^^, in terms of 
\P\ and n, showing that linear-double-exponential dependence on n is indeed the 
worst case possible. 

However, it should be noted that the worst-case bounds on g^jj^ and gj^ax 
are not representative of many important families of MPSs. In particular, note 
that MPSs corresponding to termination probabilities must have g^ax — 1- Fur- 
thermore, for a number of classes of probabilistic systems we can prove bounds 
of the form log < poly{\P\). Indeed, for MPSs corresponding to QBDs and 
probabilistic 1-counter automata, such a bound was proved in [10]. 

If the family of MPSs happens to have log , log g^ax — Poly{\P\), then 
our upper bounds show that the total number of iterations of R-DNM needed is 
only exponential in d, the depth of SCCs, and thus if d < log \P\, then for such 
MPSs R-DNM runs in P-time in the encoding size of the input, \P\ and logi, 
in the standard Turing model of computation, to compute an approximation to 
the LFP g*, within additive error e > 0. 

It should be noted that for the case of strongly connected MPSs only, and only 
for Exact Newton's Method, without rounding, [7J obtained comparable result to 
ours in terms of worst-case dependence on log and logg,"JjaxEI However, in 
[7] they did not obtain any constructive bounds in terms of \P\, q^-^^ or g*jax 
for MPSs that are not strongly connected, nor did they obtain any results for 

* Technically, their bounds are with respect to relative error, and their bounds for 
strongly connected MPSs do not depend at all on Qmaxi but of course if g^ax is large, 
then in order to obtain absolute (additive) error e > 0, the relative error required 
is e' — — — , and since their bounds depend on log ^ they depend (indirectly) on 
logQmaxi with the same magnitude as ours. 



rounded versions of Newton's method. Using exact Newton's method of course 
entails the assumption of a unit-cost arithmetic model of computation, rather 
than the Turing model. 



